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Abstract. We propose an alternative to the Bak-Sneppen model for species sur- 
vival. In our model the number of species is random. Births and deaths occur with 
constant probabilities and it is only the species with lowest fitness which is removed. 
We show that there is a sharp phase transition when the birth probability is larger than 
the death probability. The set of species with fitness higher than a certain critical value 
approach an uniform distribution. On the other hand the species with fitness less than 
the critical disappear after a finite (random) time. 

1. Introduction. In the Bak-Sneppen model, see Bak and Sneppen (1993), there 
is a fixed number of species arranged in a circular graph. At each discrete time the 
site on the circle with the lowest fitness and its two nearest neighbors have their fitness 
replaced by a random number independently sampled from the uniform distribution on 
[0, 1]. There are a number of exciting results and conjectures for the Bak-Sneppen model, 
see Gillett, Meester and van der Wal (2006). However, from a biological point of view 
the model does seem a little artificial. We propose a model that seems more natural to 
us. In our model the number of species is random (instead of a fixed number) and we 
replace only the species with the lowest fitness (instead of replacing its nearest neighbors 
as well). 

Note that replacing the species with the lowest fitness involves a global interaction 
of all the species present. The Bak-Sneppen model involves this global interaction and 
a local interaction (replacing the neighbors) . As pointed out by Meester and Znamenski 
(2003) if only the lowest fitness were replaced then the Bak-Sneppen system would 
converge to a configuration with all fitnesses equal to 1 which makes the model not so 
interesting. Even though our model involves only the global interaction it has a non 
trivial limiting behavior. We now give the details of our model. 

Consider a discrete time model that starts from the empty set. At each time n > 1 
with probability p there is a birth of a new species and with probability q = 1 — p there 
is a death of a species (if the system is not already empty). Hence, the total number 
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of species at time n is a random walk on the positive integers which jumps to the right 
with probabihty p and to the left with probability q. When the random walk is at then 
it jumps to 1 with probability p or stays at with probability 1 — p- Each new species 
is associated with a random number. This random number is sampled from the uniform 
distribution on [0, 1]. We think of the random number associated with a given species as 
being the fitness of the species. These random numbers are independent of each other 
and of everything else. Every time there is a death event then the type that is killed is 
the one with the smallest fitness. This is similar to a model introduced by Liggett and 
Schinazi (2009) for a different question. 
Take p in (1/2,1) and let 



Note that fc is in (0, 1). Let L„ and Rn be the set of species alive at time n whose fitness 
is lower and higher than /g, respectively. Since each fitness appears at most once we can 
identify each species to its fitness and think of and Rn as sets of points in (0, fc) and 
{fc, 1), respectively. Let R'^ be the set of species that have appeared up to time n (they 
can be dead or alive at time n) and whose fitness is higher than fc- Clearly, at all times 
n, Rn is a subset of i?^. Let \A\ denote the cardinal of set A. We are now ready to state 
our main result. 

Theorem. For every e > we have almost surely for n large enough that 

< Kl - \Rn\ < -^n'/'+^. 

Pfc 

In particular, the set Rn of species whose fitness is above fc approaches a set of i.i.d. 
uniform random variables on (/c, 1). 

Observe that the number of species in R'^ is a random variable that follows a bino- 
mial distribution with parameters n and p{l — fc)- Moreover, R'^ is a (random) set of i.i.d. 
uniform random variables on (fc, 1). Hence, the set R^ of species whose fitness is higher 
than fc approaches an uniform system because the number of discrepancies between R^ 
and R'^ gets smaller and smaller compared to the number of species (which is of order n 
at time n). Interestingly, the same type of uniform behavior in some ( fc, 1) is expected 
for the Bak-Sneppen model but this is still unproved, see Meester and Znamenski (2003). 

We now discuss the behavior of L„ (the set of species whose fitness is lower than 
fc). Observe that |L„| (the cardinal of L„) increases by 1 with probability p/c, decreases 
by 1 with probability q (if it is not already at 0) and stays put with probability p{l — fc)- 
Since pfc — q, \L'n\ is symmetric and it is easy to check that it is null recurrent. That 
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is, \Ln\ will hit infinitely many times but the expected time of return to is infinite. 
So there will be times when |L„| is large but it will always eventually come back to 
0. Because we kill the species with lowest fitness the surviving species will tend to be 
concentrated close to fc- See figure 1. 
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Figure 1. This is the histogram of the fitnesses after 100,000 births and deaths for 
p = 2/3. We have fc = 1/2 and as predicted by the Theorem the distribution on {fc, 1) 
approaches an uniform. 

Observe that the larger p is the more welcoming the environment is to new species. 
If p is only slightly larger than 1/2 then fc is close to 1 and only species with high fitness 
will survive. On the other hand if p is close to 1 then fc is close to and even species 
with relatively low fitness will survive. 

2. Proof of the Theorem. 

The idea of the proof is quite simple. For times n for which (the set of species 
whose fitness is lower than fc) is not empty all deaths must occur in (and no species 
in Rn can die). Hence, at these times n, Rn and R'^ are coupled. Let tn be the number 
times k < n for which Lk is empty. That is, 

tn = \{1 < k < n : Lk = (lS}\. 

We will show that, for any e > 0, t„ is almost surely less n^/^"'"'^ for n large enough. 
Hence, the number of deaths in R^ is at most n^/^^"^. The main step in the proof is the 
following Lemma. 

Lemma. There are positive constants 7 and D such that for every e > we have 
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Once the Lemma is proved we are done. For we use the Borel-Cantelli Lemma to 
show that almost surely there is a natural N such that t„ < Cn^/^"''^ for n > N, where 
C = This shows that the number of discrepancies between Rn and R'^ is at most 

Proof of the Lemma. 

Recall that we start from the empty set. After a geometric random time with mean 
denoted by Gq, the first species appears in (0, /c). That is, 

Go = min{A; > 1 : 0}. 

Let 

El = mm{k > Gq : Lk = 0}. 

Hence, Ei be the time it takes starting at time Go for \L] to return to 0. More generally, 
we define for i > 1 

Gi = min{k >Go + Ei + ... + Gi_i + Ei:Lk^ 0}, 

and 

Ei+i = mm{k > Go + El + . . . + Ei + Gi : Lk = 0}. 

Note that the {Gi)i>o and the (i?i)i>i are two i.i.d. sequences. Moreover, the Gj 
follow a geometric distribution with mean 

Let kn be the number of times that L^ hits the empty set by time n: 

kn = \{2 < k < n : \Lk-i\ = 1 and \L).\ = 0}|. 

That is, kn counts the number of times L^ goes from 1 to species for k < n. Note that 
if A;n = then tn < Gq. We now compute 

(1) P{tn > Cn^/^+') < P{tn > Cn^/^+'; kn < n^/^+') + P{kn > n^/^+'). 

For fcn > 1 we have 

Go + Gi + . . . + Gfe„-i < tn < Go + Gi + . . . + Gfe„, 

and for fc^ = we have tn < Go- Hence, 

P{tn > Gni/2+^; kn < n^/^+') < P{Go + Gi + . . . + G^„ > Cn^/^+'), 
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where is the integer part of ■n}-!'^^^. Now, the expected value of Gq + G^i + • • • + G^m„ 
is "^"y^-*- . By a standard large deviations inequality (see for instance Lemma (9.4) in 
Chapter 1 of Durrett (1996)) there exists 7 > such that 

(2) P(Go + Gi + . . . + > Gni/2+^) < exp(-7m,) < exp(-7(ni/2+^ _ 1)). 

We now take care of the second term in the r.h.s. of (1). Observe that |L„| increases by 
1 with probability p/c, decreases by 1 with probability q (if it is not already at 0) and 
stays put with probability ]3(1 — /c). Since p/c = g, is essentially a simple symmetric 
random walk with a reflecting barrier at 0. In particular it is a recurrent Markov chain 
and it will come back to infinitely often. 

Now using that the Ei are i.i.d. and that for 1 < i < /c^ — 1 they all must be less 
than n, 

P{K > n^/2+e) < P{Ei < n)^"-^ 

It is easy to couple \Ln\ to a genuine simple symmetric random walk (one that jumps +1 
or -1 with probability 1/2 at each step) so that \Ln\ takes more time to go from 1 to 
than the genuine walk does. Let Tq be the time for a genuine simple symmetric random 
walk to hit 0. We have 

It is well known that Pi (Tq > n) is asymptotically 1 / •\/7rn/2, see for instance Chapter 
III in Feller (1968). Hence, there are constants 7' > and D such that 

(3) P{kn > ni/2+^) < exp(-y^^^^) < Dexp(-7V). 

Using (2) and (3) in (1) completes the proof of the Lemma and therefore of the Theorem. 
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